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(54) Disk control device for disk array systems 

(57) A disk control device (20) is operationally con- 
nected to a high-order device (30) and operates a plu- 
rality of disk devices (32) . A cache (22) is included in 
the disk control device for storing data blocks (34; LB-0 
- LB-19) and parity data (PO; PI ; P2; P3: P4) from ^ plu- 
rality of parity groups (LB-O to LB-S. PO; LB-4 to LB-7. 
PI ; LB-8 to LB-1 1 . P2; LB:12 to LB-15, P3; LB-16 to LB- 
19, P4) in the plurality of disk devices (32). The disk 
control device also includes a cache management table 
(24) for indicating the positions of the data stored in the 
cache. Cache management means (26) is provided for 
transferring selected data requested by the host control 
(30) from the plurality of disk devices (32) to the cache 
(22) when the cache management table (24) indicates 
that said selected data is not stored in the cache (22). 
Control means (28) transfers the selected data from the 
cache (22) to the high-order device (30) when the cache 
management means (26) informs the control means 
(28) that the selected data is stored in the cache (22). 
Moreover, the cache management means (26) holds 
selected parity data (PO; P1 ; P2; P3: P4) in the cache 
when data belonging to the same one of the plurality of 
parity groups (LB-O to LB-3. PO; LB-4 to LB-7. PI ; LB-8 
to LB-1 1 . P2; LB-12 to LB-15. P3; LB-16 to LB-19. P4) 
as the selected parity data is stored in the cache, and 
expels the selected parity data from the cache when no 
data belonging to the same one of the plurality of parity 
groups as the selected parity data is stored in the 
cache. 
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Description 

[0001 ] TTiis invention relates generally to a disk control device for disk array systems and more particularly, to a disk 
control device adapted to be used with RAID-4 and RAID-5 disk systems for increasing the data access speed by 
5 reducing the number of accesses by the disk control device to the disk devices. 

BACKGROUND OF THE INVENTION 

[0002] Disk array systems devices which allow great volumes of data to be transferred in and out and from many disks 
10 at high transfer speed and with relatively good reliability have been in use for many years. These disk array systems 
were classified as Redundant An-ays of Inexpensive Disks (RAID) 1 to 5 by Dr. David A. Patterson et al. of the University 
of California (see ACM SIGMOD Conference, Chicago. Illinois, June 1-3, 1988, R109-166). The present invention 
relates for example to RAID-4 and RAID-5. which will now be discussed. 

[0003] It is known in the art that the reliability of systems such as the RAlD-4 and the RAID-5 systems is provided by 
IS parity or checking or redundant data which is created or derived from a combination of "informational" data to which the 
parity or checking or redundant data refers, e.g. data stored in the same memory block location of each of the disk 
devices. Herein, such parity or checking or redundant data, whether created in accordance with the RAID-4 or RAID-5 
systems or by other means or other protocols, will be referred to simply as parity data. The data which creates the parity 
data is fomied into a group called a parity group The parity data of each parity group is stored in each of the disk 
20 devices in the anay and is Included In the parity group. 

[0004] One of the characteristics for example of the RAID-4 and the RAID-5 systems is that each time data of a par- 
ticular parity group is altered, the parity data of that parity group is recalculated and updated so that the redundancy of 
the parity data is maintained. This requires the disk control device to not only access the memory block of a disk con- 
taining the desired data, but also the memory block of the disk containing the corresponding parity data, each time the 
25 data is accessed from the disk anray Consequently, this causes a delay in operatfon. since the disk control device for 
the RAID-4 and the RAID-5 systems must always access an additional memory block each time data is accessed. In 
other words, the inherent characteristics for example of the RAID-4 and the RAID-5 systems result in reduced access 
speed during the reading and writing process. Thus, there is a need for a disk control device, for exanple for the RAID- 
4 and the RAID-5 systems, which is able to increase the access speed. 

30 

SUMMARY OF THE INVENTION 

[0005] According to the present invention there is provided a disk control device operationally connected to a high- 
order device for operating a plurality of disk devices, and having a cache for data blocks and associated parity data from 
35 a plurality of parity groups in the plurality of disk devices, said disk control devfoe comprising: 

a cache management table for indicating positions of the data blocks stored in the cache; 
cache management means for transfenring a selected data block requested by the high-order device from the plu- 
rality of disk devices to the cache when said cache management table indicates that said selected data block is not 
40 stored in the cache; and 

control means for transf en-ing said selected data block from said cache to said high-order device when said cache 
management means informs said control means that said selected data block is stored in the cache; 

wherein said cache management means holds selected parity data in the cache when at least one data block 
45 belonging to the same one of the plurality of parity groups as said selected parity data is stored in the cache, and expels 
said selected parity data from the cache when said at least one data block belonging to the same one of the plurality of 
parity groups as said selected parity data is not stored in the cache. 

[0006] An embodiment of this invention can provWe a new and improved disk control device for example for the RAID- 
4 and the RAID-5 systems. 

so [0007] An embodiment of this invention can provide a new and Improved disk control device having increased access 
speed, for example for use with the RAlD-4 and the RAlD-5 systems. 

[0008] An embodiment of this invention can provkJe a new and improved disk control device for example for the RAID- 
4 and the RAID-5 systerns which does not access the parity data each time the nonparity data is accessed. 
[0009] Further, an embodiment this invention can provide a new and improved disk control device for example for the 
55 RAID-4 and the RAID-5 systems which holds the parity data in a cache until all the nonparity data of the corresponding 
parity group is transferred out of the cache. 

[0010] In keeping with one embodiment of this invention, a disk control device is operationally connected to a high- 
order device and operates a plurality of disk devices. A cache Is included in the disk control device for storing data 
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including a plurality of parity data from a corresponding plurality of parity groups in the plurality of disk devices. TTie disk 
control device also includes a cache management table for indicating the positions of the data stored in the cache. 
Cache management means is provided for transferring a selected data request by the host control from the plurality of 
disk devices to the cache when the cache management table indicates that said selected data is not stored in the 
5 cache. Control means transfers the selected data from the cache to the high-order device when the cache management 
means Informs the control means that the siBlected data is stored in the cache. Moreover, the cache management 
means holds selected parity data in the cache when data belonging to the same one of the pluralfty of parity groups as 
the selected parity data are stored in the cache, and expels the selected parity data from the cache when no data 
belonging to the same one of the plurality of parity groups as the selected parity data is stored in the cache. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

[001 1] The above-mentioned and other features of embodiments of this invention and the manner of obtaining them 
will become more apparent, and will be further understood by reference by way of example to the following description, 
75 taken in conjunction with the accompanying drawings in which: 

FIG. 1 is a block diagram of a disk control device in accordance with the principle of the present invention; 
FIG. 2 is a diagram showing the configuration of one type of disk array system; 
FIG. 3 is a table used in converting a logical address into a physical address for the disk array of FIG. 2; 
20 FIG. 4 is a block diagram of a cache management section of the disk control devrce of FIG. 1 in accordance with 
the principle of this invention; 

FIG. 5 is a pointer table in a cache management table of the disk control device off FIG. 1 for indicating the top and 
the bottom entries of the data stored in a cache; 

FIG. 6 is a LRU link table In the cache management table for showing the data stored in the cache and the order in 
25 which these data were entered in the cache; 

FIG. 7 Is an entry table in the cache management table for showing the parity groups of the data stored in the 
cache; 

FIG. 8 is a flowchart illustrating the operation of the control section of the disk control device of FIG. 1 ; 
FIG. 9 is a flowchart illustrating the operation of the cache management section; 
30 FIG. 10 is a flowchart illustrating the detailed process for transfemng data from the disk devices to the cache: 
FIG. 11 is a flowchart illustrating the detailed process for expelling data from the cache; 
FIG. 12 Is a flowchart illustrating the operations of the control section and the cache management section in read- 
ing data from the cache when the data is present in the cache; 

FIG. 13 is a flowchart illustrating the operations of the control section and the cache management section in read- 
35 ing data from the cache when the data is not present in the cache; 

FIG. 14 is a pointer table in the cache management table for indicating the top and the bottom entries of the parity 

groups stored in the cache in accordance with an alternate principle of the present invention; 

FIG. 1 5 is a LRU link table in the cache management table for indicating the order off entries of the parity groups in 

the cache in accordance with the alternate principle off the present invention; 
40 FIG. 16 is an entry table in the cache management table for indicating the parity groups stored in the cache in 

accordance with alternate principle of the present invention; and 

FIG. 17 is a flowchart illustrating the operation of the control device for transferring the target data from the disk 
devices to the cache in accordance with the alternate principle of the present invention. 

45 DETAILED DESCRIPTION 

[0012] As seen in FIG. 1 , a disk control device 20 includes a cache 22, a cache management table 24. a cache man- 
agement section 26 and a control section 28. The disk control devtoe 20 is operationally connected between a high- 
order device 30 and five disk devices 32 (lat^eled Disk 0-Disk 4). each of which stores a fixed length of data which are 

so accessible to the high-order device via the control device. The number of disk devices 32 connected to the control 
device 20 can vary as required. Five disk devices are shown in FIG. 1 as a typical number. 
[0013] The cache 22 at various times stores portion of data stored in the disk devices 32 as required by the high-order 
device 30. and the cache management table 24 keeps track of these data in the cache including their locations, the 
order of their entry into the cache 22 and the information about their respective parity groups. The cache management 

55 section 26 transfers data between the disk devices 32 and the cache 22 by referring to the cache management table 
24. Transfer of data between the high-order device 30 and the cache 22 is performed by the control section 28, as 
authorized by the cache management section 26. 

[0014] As shown in FIG. 2, each of the disk devices 32 has five memory blocks 34 (labeled Block 0 to Block 4). four 
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of which store the transfer or Informational data, while one stores the parity or redundant data, which is created in 
known manner from the combination of the transfer data stored in the other four disk devices. Having this configuration, 
any data In the disk devices 32 can be identified with a disk device number and a block number, i.e.. the physical 
address. 

5 [0015] Most conventional high-oider devices 30 recognize the array of disk devices 32 to which it is connected as 
being a single large-capacity disk device, regardless of the number of disk devices that are actually connected to the 
high-order device. To be compatible with the high-order device 30. the disk control device 20 operates as if it is con- 
nected to a single, virtual disk device having virtual memory blocks. In accessing a particular memory block 34 in the 
disk devrces 32. the high-order device 30 specifies the virtual disk device number of the data and the virtual block 

10 number of the data it wishes to obtain, i.e., the logical address. 

[001 6] When the logical address is specified by the high-order device 30, the disk control device 20 converts the log- 
ical address into a physical address including the physical disk device number and the physical block number, i.e.. the 
number of the disk device 32 and the memory block 34. Conversion from the logical address to the physical address is 
accomplished in accordance with the number of the disk devices 32 connected to the control device 20. The desired 

IS physical address is obtained from a conversion table 35 shown in FIG. 3, using values A-C. which are calculated as fol- 
lows: 

Value A: Remainder of (Logical address/(Number of physical disk devices- 1)} 
20 Value B: Value of {Logical address/(Number of physical disk devices - 1)} after the fraction is discarded 
Value C: Remainder of (Value B/Number of physical disk devices] 

[0017] If, for example, five disk devk:es 32 are connected to the disk control device 20 as shown in FIG. 1 , and the 
25 logical address specified by the high-order device 30 Is 11 . the values A-C are obtained as follows: 

Value A: Remainder of {1 1/(5 - 1)} = 3 

Value B: Value of {1 1/(5 - 1)} after the fraction is discarded ^ 2 

30 

Value C: Remainder of (Value 8/5) = 2 

As shown in the conversion table 35 of FIG. 3, the values A =3 and C=2 indicate that the physical address oon-espond- 
ing to the logical address 1 1 is disk 4, block 2. 
35 [0018] Turning back to FIG. 2, and examining the same block number in each of the five disk devices 32. it is shown 
that the transfer data is stored in only four of the five dsk devices at that block number. The remaining one of the five 
disk devices 32 stores the parity data, which is created from the transfer data of the other four disk devices in the same 
block number. 

[001 9] The disk device 32 In which the parity data is stored Is determined by value D which is calculated as follows: 

40 

Value D » (Number of physical disk devices - 1 - Value C) 

Assuming as atK>ve that the logical address is 1 1 and the number of disk devices Is five, the parity data is stored in disk 
2, i.e., Value D=(5-1-2)=2, which is consistent with the diagram of FIG. 2. 

45 [0020] The parity data together with the transfer data from which the parity data is created form a parity group. For 
example, as shown in FIG. 2, parity data PO is created from transfer data at physical addresses Disk 0/Block 0, Disk 
1/Block 0, Disk 2/Block 0 and Disk 3/Block 0 and stored at physical address Disk 4yBlock 0. These five blocks form a 
parity group. In the RAID-5 system, parity data PI created from data in Disk 0/Block 1 , Disk 1 /Block 1 , Disk 2/Block 1 
and Disk 4/Block 1 are stored in Disk 3/Block 1 . In the RAlD-4 system, the parity data Is always stored in Disk 4. 

so [0021 ] As shown in FIG. 4, the cache management section 26 includes a microprocessor unit (MPU) 36 which is oper- 
ationally connected to a memory 38. The MPU 36 is also connected to the control section 28, the disk devices 32 and 
the cache management table 24. The memory 38 includes a parity registration judging unit 40, a group registration 
judging unit 42, a data expulsion unit 44, a data registration unit 46. a least recently referenced data (LRU) control unit 
48. Each of these units or microprograms 40, 42, 44, 46, 48 is connected to an operation determination unit 50, which 

55 is also included in the memory 38. 

[0022] The cache management tat)le 24 provides information regarding data stored in the cache 22. and includes a 
pointer table 52, a link table 54 and an entry table 56, respectively shown in FIGS. 5-7. The pointer table 52 shows the 
logical address of the top (last) and the bottom (first) data that are stored in the cache 22. The link table 54 shows the 
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LRU "link" status or the order in which the data from the cache 22 are stored or registered in the cache 22. An "Entry" 
column of the link table 54 indicates the logical address of the stored data, and the link or relationship between each 
entry is shown by the logical addresses immediately before and after a selected logical address. For example, the log- 
ical address in the "Fonward" column of a selected logical address in the "Entry" Indicates the logical address of the data 

5 which is stored or registered after the data of the selected logical address, and the logical address in the "Backward" 
column indicates the data which was registered prior to the registration of the data of the selected logical address. 
[0023] The entry table 56 of FIG. 7 includes a "Target Block" column, a "Bit Map" column, four "Block" columns and 
a "Parity" column. Each of the entry lines (lines labeled 0-4 shown in FIG. 7) holds information regarding the data in the 
cache 22 belonging to the same parity group. TTie values in the "Bit Map" column consist of four bits which correspond 

10 to the lour "Block" columns, and indicate the status of the data in each of the four data blocks (0 Block. 1 Block, 2 Block 
and 3 Block) in the parity group. A bit set to "1" indicates that the data belonging to the con-esponding block in the parity 
group is registered in the cache 22. and a bit set to "0" indicates that the data in the corresponding block in the parity 
group is not registered in the cache 22. For example, the bit map of entry line 0 is set to "0001" in the entry table 56 of 
FIG. 7. indicating that the data belonging to 3 Block (the fourth block) of the parity group is registered in the cache 22 

75 and that.the data belonging to 0 Block, 1 Block and 2 Block are not stored in the cache. The corresponding logical 
addresses (denoted "LB") of the data blocks 0-3 of the respective parity groups are listed in the Target Block" column 
of the entry table 56. 

[0024] The operation of the control section 28 is now described while referring to the flowchart of FIG. 8. At the start 
(800), the control section 28 waits for an access from the high-order device 30 or the cache management section 26 
20 (S801). If no access is detected, the process goes back to step (S801). However, if an access is detected (S802), the 
control section 28 judges whether the access is from the high-order device 30 or from the cache management section 
26(S803). 

[0025] If the access is from the high-order device 30, the control section 28 inquires of the cache management section 
26 as to whether the target data requested by the high-order device is registered in the cache 22 (S804), and then waits 
25 for a response from the cache management section (S805). Once a response is detected (S806), the control section 
28 analyses the response to determine whether the target data is registered (a "cache hit") or not registered in the 
cache 22 (a "cache miss") (8807). 

[0026] tf the result is a cache hit. the control section 28 transfers the target data stored in the cache 22 to the high- 
order device 30 (S808). The storage address of the target data is provided by the cache management section 20 to the 
30 control section 28. This finishes the process performed by the control section 28 (S809), and the control section again 
warts for an access (S801 ). 

[0027] If the resuK Is a cache miss at step (S807), however, the control section 28 goes to step (S801) and waits for 
the target data to be stored in the cache 22 from the disk devices 32 by the cache management section 26 (S810). Dur- 
ing this wait, the control section 28 can execute a completely different process. If an access is detected during the wait 
35 period, the control section 28 judges ¥vhether the access is from the high-order device 30 or from the cache manage- 
ment section 26 (S803). 

[0028] If the access is from the cache management sectton 26. the target data is stored in the cache 22 and trans- 
ferred from the cache to the high-order device 30 (S808). This ends the process (S809), and the control section 28 
again waits for an access from either the high-order device 30 or the cache management sectton 26 (S801 ). 

40 [0029] The operation of the cache management section 26 is now described with reference to the flowchart of FIG. 
9. At the start of the operation (S900). the cache management section 26 waits for a communication from the control 
section 28 (S901). In this state, the MPU 36 of the cache management section 26 functions in accordance with the 
directions from the operation determination unit 50. The operation determination unit 50 detects communications from 
the control section 26, determines the process to be performed by the cache management section 26, and activates 

45 necessary units 40-48 in the memory 38 for performing various functions. 

[0030] If a communication from the control section 28 is detected (S902), the operation of the cache management 
section 26 is passed to the control of the LRU control unit 48 as directed by the operation determination unit 50. The 
LRU control unit 48 determines whether the target data required by the control section 28 is registered in the cache 22 
by checking the link table 54 (see FIG. 6) of the cache management table 24 (S903). If the entry for the target data is 

so found in the link table 54 (a cache hit), the LRU control unit 48 notifies, and returns the process control to, the operation 
determination unit 50 (S904), which then notifies of the existence of the target data in the cache 22 to the control section 
28 by providing it with the storage address of the data (S905). The operation of the cache management section 26 is 
terminated at this point (S906). 

[0031 ] If at step (S903) the entry for the target data is not detected In the link table 54 (a cache miss), the LRU control 
55 unit 48 notifies the operation determination unit 50 of its finding and also returns the process control to the operation 
determination unit 50. The operation determination unit notifies the control section 28 of the cache miss condition 
(3907), and initiates a process to transfer the target data from the disk devices 32 into the cache 22 (S908), and waits 
(S909). After the target data is transfen-ed into the cache 22, the cache management section 26 notifies the control sec- 
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tion 28 that the data transfer has been completed by providing the storage address of the target data in the cache 22 
(S910). This finishes the process by the cache management section 26 (S91 1). 

[0032] The process for transferring the target data from the disk devices 32 to the cache 22 discussed in step (S908) 
of FIG. 9 is described in more detail with reference to the flowchart of FIG. 10. If the data requested by the high-order 
5 device 30, i.e., the target data, does not exist In the cache 22, as in step (S903) of FIG. 9. the operation determination 
unit 50 activates the data registration unit 46. 

[0033] At the start of the data registration process (SI 000) in which the target data from the disk devices 32 is trans- 
ferred to the cache 22. the data registration unit 46 checks to determine whether the cache 22 has a space for storing 
new data, and whether the link table 54 has a space for a new entry (S1 00 1 ). If the cache 22 and the link table 54 have 
10 no space available, the new data cannot be registered. In this situation, the data registration unit 46 passes the process 
control to the data expulsion unit 44 to create as space in the cache 22 for the new data by expelling an existing regis- 
tered data from the cache (Si 002). This process is described in detail below with reference to the flowchart of FIG. 1 1 . 
After making an area available in the cache 22, the data expulsion unit 44 returns the process control to the data reg- 
istration unit 46. 

75 [0034] In accordance with one important aspect of the present invention, the parity data for a particular parity group 
is maintained in the cache 22 once it is initially stored in the cache, and not transferred back to the disk devices 32, as 
long as at least one data from the same parity group is also stored in the cache. In this manner, the parity data is not 
transferred back and forth between the cache 22 and the disk devices 32 each time data belonging to the same parity 
group as the parity data is transfen-ed between the disk devices and the cache, thereby reducing disk access time. 

20 [0035] To accomplish the above-described feature for reducing the disk access time, the data registration unit 46 acti- 
vates the group registration judgment unit 42, which checks theTarget Block" column of the entry table 56 In the cache 
management table 24 to determine whether the parity group of the target data is entered in the entry table {S1003). In 
the present invention, the first time data from a particular parity group is transferred to the cache 22, an entry Is made 
in the entry table 56 for the parity group as a whole, including the parity data. For example, if the logical address of data 

25 block 3 of a parity group is LB-3 and this data block is the first from Its parity group to be stored in the cache 22, the 
logical addresses (LB-0. LB-1 and LB-2) of the other data blocks in the parity group are also entered in the Target 
Block" column (see entry 0. the first entry line, in the entry table 56). This is the case whether or not the actual data of 
the logical addresses LB-0. LB-1 and LB-2 are stored In the cache 22. 

[0036] Therefore, detection of any of the logical addresses of the data blocks of a parity group in the "Target Stock" 
30 column of the entry table 56, indicates that the parity data for that parity group exists In the cache 22. In the preferred 
embodiment, the group registration judgement unit 42 checks the Target Block" column for the logical address of the 
target data block to detenmine whether the parity data exists in the cache 22. After giving notification of the detection 
result to the data registratton unit 46, the group registration judgment unit 42 returns the process control to the data reg- 
istration unit (SI 004). 

35 [0037] If the logical address of the target data block is not detected in the entry table 56, the data registration unit 46 
reserves two blocks of area in the cache 22, one for registering the new data and the other for registering the parity data 
of the parity group to which the new data belongs (SI 005). The data registration unit 46 then records the storage 
address (location) of the reserved blocks in the corresponding "Block" column and the "Parity" column of the entry table 
56 (S1006). The data block positions (i.e., data blocks 0-3 corresponding to the logical address LB) of the new data 

40 within the parity group is calculated, and a bit map indicating the block position is aeated (SI 007). For exanple, if the 
data block position of the new data is block 3, the bit map wouM be set to 0001 . The bit map is entered in "Bit Map" 
column of the entry table 56 (SI 008) to complete the registration of the new data into the entry table. 
[0038] If at step (SI 003), the logical address of the target data is delected in the "Target Block" column of the entry 
table 56, the data registration unit 46 resen/es only one block of area in the cache 22 for registering the new data 

45 (SI 009). The data registration unit 46 then records the location of the reserved block in the entry table (SI 01 0). and the 
data tAock position of the new data within the parity group is calculated. Then the corresponding bit in the bit map, which 
is already in the "Bit Map" column of the entry table 56. Is set to "1" to finish updating an existing parity group entry in 
the entry table (S1011). 

[0039] Regardless of whether or not the parity data is stored in the cache 22, the pointer table 52 and the link table 
so 54 of FIGS. 5 and 6. respectively, are updated after the bit map is updated. As such, the logical address of the new data 
is entered as the top entry for the link table 54 and the pointer table 52 (SI 01 2). In the link table 54, the logical address 
of the new data is entered In the "Fonward" column of the fomier top entry so that the new top entry Is linked with the 
previous entries (SI 01 3). Lastly, the data registration unit 46 instructs the disk device 32 containing target data and. if 
necessary, the one containing the parity data, to transfer the data to the cache 22 (S1014). This ends the process for 
55 transferring data from the disk devices 32 to the cache 22 (SI 01 5). and the process control is returned to the operation 
determination unit 50 at step (S908) of FIG. 9. 

[0040] It should be noted that while the pointer and the link tables 52, 54 are described above as being updated before 
the new data actually is transferred into the cache 22, these tables may also be updated after the new data has been 



6 



EP0936533 A2 



transferred to the cache. 

[0041] The process in which the registered data is expelled from the cache 22 to create a space for the new data, as 
briefly described above in step (S1002) of FIG. 10, is described in more detail with reference to the flowchart of FIG. 
1 1. As described above, if the cache 22 does not have sufficient space for the new data, the data registration unit 46 
5 passes the process control to the data ecputsion unit 44 to create a space in the cache 22 for the new data by expelling 
the existing registered data from the cache. At the start of the expulsion process (S1 100), the data expulsion' unit 44 
references the pointer table 52 (shown in FIG. 5) to obtain the logical address of the data at the bottom of the LRU link, 
i.e., the data least recently referenced. The data identified by this logical address is then expelled from the cache 22 

(51 101) . If, for example, the logical address at the bottom of the LRU link is 3, as shown in the link table 54 of FIG. 6. 
10 the data con^esponding to this address would be expelled. 

[0042] The data expulsion unit 44 then checks the link table 54 for the entry of the logical address of the expelled data 

(51 102) . At step (S1 103), if the entry of the expelled data block is not found, the process goes back to step (S1 103). 
However, if that entry is found (S1 103), the fbnvard pointer value (FPV), which indicates the logical address of the data 
that was transferred to the cache 22 immediately after the expelled data, is obtained from the Tonward" column of the 

IS entry of the expelled data. Again assuming that the data of logical address 3 has been expelled, the fonward pointer 
value is 1 8 in the entry table 54 of FIG. 6. Logical address 1 8 corresponding to the fonward pointer value is accordingly 
positioned at the bottom of the LRU link (S1 104). 

[0043] The data expulsion unit 44 checks the link table 54 again for the entry of the logical address (18) newly posi- 
tioned at the bottom of the LRU link (S1 1 05). At step (S1 1 06), if the entry is not detected, the process goes back to step 

20 (S1 105), but if it is detected, the backward pointer value (BPV) in the "Backward" column of that entry is substituted with 
an invalid value (XX) (S1 107). As shown in FIG. 6, value 3 in the "Backward" column of the entry line of logical address 
18 is replaced with invalid value XX. since the entry of logical address 18 is now at the bottom of the LRU link. Subse- 
quently, the entry of the expelled data in the link table 54 is replaced with an invalid value (S1 108), and the logical 
address (3) of the expelled data in the pointer table 52 is replaced with the new logical address (18) at the bottom of the 

25 LRU link (S1 109). 

[0044] With respect to the entry table 56 of FIG. 7, the data expulsion unit 44 retrieves the logical address of the 
expelled data from the "Target Block" column of the entry table (Si 1 1 0). At step (S1 1 1 1 ), if the logical address cannot 
be located the process goes back to step (S1 110). After the logical address is detected, the data expulsion unit 44 
updates the bit map corresponding to the parity group in which the expelled data belongs. The bit map is located in the 
30 "Bit Map" column of the entry table 56. and only the bit corresponding to the expelled data is reset from a "1" to a "0" 
(S1112). 

[0045] At this point, the data expulsion unit 44 activates the group registration judgment unit 42 to determine whether 
there are any other data stored in the cache 22 that belong to the same parity group as the expelled data. The group 
registration judgment unit 42 performs this function by looking at the bit map corresponding to the parity group to deter- 
35 mine whether all the bits in the bit map are O's (S1 1 1 3). If all the bits are 0*s, then no data is stored in the cache 22 that 
belongs to the same parity group as the expelled data. However, if the bit map has at least one "1 then there are data 
stored in the cache 22 which belong to the same parity group as the expelled data. 

[0046] If all of the bits in the bit map are O's, the data expulsion unit 44 expels the parity data from the cache 22 
(S1 114). and records an invalid value in the Target block" column of the entry of the expelled data to delete the entry 

40 (S1 115). The data expulsion process is then ended (S1 116), and the data registration unit 46 continues with the proc- 
ess for transferring the target data from the disk devices 32 to the cache 22 at step (SI 002) of FIG. 10. 
[0047] However, if there are other data in the cache 22 that belong to the same parity group as the expelled data, the 
data expulsion unit 44 updates only the corresponding bit map as explained above, the parity data is held in the cache 
and the entry is not deleted from the entry table 56. The data expulsion process is ended at this point (S1 1 16), and the 

45 data registration unit 46 continues with the process for transferring the target data from the disk devices 32 to tiie cache 
22 at step (SI 002) of the flowchart of FIG. 10. 

[0048] In the preferred embodiment of the present invention, the cache management section 26 includes the MPU 36 
and the memory 38, and each of the functions described above is performed in accordance with the instructions from 
the microprograms stored in the memory It is contemplated, however, that the cache management section 26 may also 

so use hard-wired logic to realize functions similar to those described above. 

[0049] The interaction of the control section 28 with the cache management section 26 in performing the general oper- 
ation of the disk control device 20 is described below with reference to FIGS. 12 and 13. When the high-order device 
30 requests data from a particular logical address, the control section 28 (S1200) sends a signal to the cache manage- 
ment section 26 to obtain the requested data from the cache 22 (S1201). The cache management section 26 (S1202) 

55 checks the link table 54 in the cache management table 24 to determine whether the target data exists in the cache 22 
(SI 203). 

[0050] Assuming, for example, that the target data requested by the high-order device 30 is in logical address 13. 
which is listed in the fourth entry line of the link table 54. it is determined that the target data is stored in the cache 22. 
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Subsequently, the cache management section 26 checks Target Block" column of the entry table 56 for the logical 
address (LB-13) that corresponds to the target data (S1204). As shown in FIG. 7, logical address 13 of the target block 
is in entry line 3 and includes logical address LB12 to LB15. In addition to checking the link table 54 as in step (S1203). 
the cache management section 26 also looks at the "Bit Map" column of the entry table 56 to double check whether the 

5 target data of the logical address 1 3 is stored in the cache 22, which is the case In the instant example as shown in the 
FIG. 7 (see entry line 3. column labeled "1 Block"). The cache management section 26 then notifies the control section 
28 of the existence of the target data in the cache 22 by providing the data storage address of the target data in the 
cache 22 (Si 205). This finishes the operation of the cache management section (S1206). The operation of the control 
section is finished (SI 208) after the control section 28 retrieves the target data from storage address in the cache 22 

10 specified by the cache management section 26. and sends the target data to the high-order device 30 (S1207). 

[0051] In the above description regarding the overall operation between the. control section 28 and the cache man- 
agement section 26, it was assumed that the target data requested by the high-order device 30 exists in the cache 22. 
A description will now be given with reference to FIG. 1 3 while assuming that the target data does not exist in the cache 
22. When the high-order device 30 requests data from a particular logical address, the control section 28 at the start of 

15 this process (SI 300) sends a signal to the cache management section 26 to obtain the target data from the cache 22 
(S1301). In response, the cache management section 26 (S1302) checks the link table 54 in FIG. 6 for the logical 
address of the target data. If the logical address specified by the high-order device 30 is 11 , for example, and the link 
table 54 Is as shown In FIG. 6, logical address 1 1 would not be found in the link table. At this point, it is determined that 
the target data does not exist in the cache 22 (SI 303). 

20 [0052] The cache management section 26 notifies the control section 28 that the target data does not exist in the 
cache 22 and that It has started the process to read the data from the disk devices 32 into the cache (Si 304). Upon 
receiving this notification from the cache management section 26, the control section 28 interrupts the operation for 
processing the data request from the high-order device 30 until the cache management section 26 stores the target 
data in the cache (SI 305). Then the control section 28 executes a different process until the target data is stored in the 

25 cache 22. i^eanwhile, the cache management section 26 performs the process to read the target data into the cache 
22 (SI 306). which is described above with reference to FIGS. 10 and 11 and further described below. 
[0053] After the target data, and also the parity data if necessary, Is transferred to the cache 22 from the disk devices 
32, the cache management section 26 notifies the control section 28 of the completion of the transfer process by pro- 
viding the data storage address of the target data in the cache (SI 307). The data request operation for the cache man- 

30 agement section 26 is ended at tills point (S1308). 

[0054] When the completion notice is received from the cache management section 26, the control section 28 inter- 
rupts or terminates ttie process that it was performing (SI 309), and supplies the data from tiie storage address pro- 
vided by tiie cache management section 26 to the highK)rder device 30 (SI 310). The data request operation for ttie 
control section 28 is ended at this point (S131 1). 

35 [0055] The step (SI 306) described above for reading ttie target data into the cache 22 requires converting the logical 
address received from ttie control section 28 into a physical address. For exanple; if the logical address Is 1 1 , the phys- 
ical address becomes Disk 4yBlock 2 since values A, B and C would be 3. 2 and 2, respectively, using the equations 
given above for obtaining ttiese value. Using the physical address, the cache management section 26 locates ttie disk 
device 32 and ttie block number where the target data is stored. The target data Is ttien read into ttie cache 22. If ttie 

40 parity data for the parity group of ttie target data does not already exist in the cache 22. ttie parity data is also read Into 
the cache as described above in reference to step (S1005) of the fkiwchart of FIG. 10. The disk device 32 that stores 
the parity data is determined from ttie equation given above for obtaining value D. If tiie k3gical address of ttie target 
data is 1 1 . ttien value D would be 2 and ttie parity data would be in Disk 2. 

[0056] After ttie physical address is calculated and before ttie target data is read into the cache 22. the cache man- 
45 agement section 26 reserves an area in the cache for the target data. This process is described above with reference 
to flowcharts of FIGS. 10 and 1 1 , and again described below using logical address 1 1 as an example. To reserve an 
area in the cache 22. the cache management section 26 first checks the link table 54 for a blank entry (see step (SI 001 ) 
in the flowchart of FIG. 10). If there Is no blank entry line, the cache management section 26 expels the least recently 
referenced data from the cache 22, i.e., the data at the bottom of ttie LRU link (see step (SI 002) In the flowchart of FIG. 
so 10 and ttie flowchart of FIG. 1 1). However, if the link table 54 is as shown In FIG. 6. for example, ttiere is an unused 
entry line (the last line of the link table 54). The logical address of the new data is entered in this unused line by ttie 
cache management section 26. 

[0057] If the link table 54 permits a new entry, either as a result of ttie least recently referenced data being expelled 
or unused area being available, the cache management section 26 reserves one block of area In the cache 22 for stor- 
55 ing the target data. Assuming tfiat the parity group of ttie target data is not registered In the cache 22, one additional 
block of area is reserved in ttie cache 22 for storing tiie parity data because the enfry table 56 wouW not already store 
the parity data. Thus, the cache management section 26 reserves two blocks of area in tiie cache 22, and records ttie 
allocation information into ttie entry table 56 (best seen in FIG. 7). 
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[0058] Recording the information regarding the reserved blocks involves errtering the logical addresses LB-8 to LB- 
1 1 in the "Target Block" column of an open entry line in the entry table 56, since as shown In FIG. 2, logical addresses 
LB-8 to LB-1 1 belong to the same parity group. The bit map for the new entry is set to "0001 since LB-1 1 corresponds 
to the fourth data block of the parity group and the fourth bit of the bit map. The bit map is recorded in the "Bit Map" 
5 column of the entry table 56, which completes the registration of the new entry In the entry table by the cache manage- 
ment section 26 (see steps (S1007) and (S1008) of the flowchart of FIG. 10). 

[0059] The cache management section 26 then updates the link table 54 (seen in FIG. 6) by entering new logical 
address 1 1 in the "Entry" column of the unused entry line and entering an invalid value (XX) in the "Fonward" column as 
the fonvard pointer value (FPV) and the logical address of the former top entry (13) in the "Backward" column as the 

10 backward pointer value (BPV) for the new top entry 1 1. Then the cache management section 26 changes the value in 
the "Fonward" column of the former top entry (1 3) from the invalid value (XX) to the logical address of the new top entry, 
which is 1 1 . The cache management section 26 also changes the value in the "Top" column of the pointer table 52 from 
1 3 to 1 1 to indicate that data of logical address 11 is at the top of the LRU link among the data stored in the cache 22. 
[0060] After the link and the pointer tables 52, 54 are updated, the cache management section 26 respectively records 

IS the data storage addresses of logical address 1 1 and the parity data in the corresponding "3 Block" and "Parity" col- 
umns of the entry table 56. Then the cache management section 26 instructs the disk device 32 (Disk 4/Block 2) con-e- 
sponding to logical address LB-1 1 and the disk device 32 (Disk 2/Biock 2) which holds the parity data (P2) to transfer 
the data to the storage addresses recorded in the "3 Block" and the "Parity" columns. After the target data and parity 
data are transferred to the cache 22. the cache management section 26 notifies the control section 28 of the completion 

20 of the data transfer process. 

[0061] In the above-described operation of the invention, the target data is transfen'ed individually from the disk 
devices 32 to the cache 22. In accordance with another aspect of the present invention, the target data is obtained by 
transferring parity group of the target data from the disk devices 32 to the cache 22 in its entirety. In other words, the 
entire parity group of the target data is treated as the target data itself. Accordingly, the numbers in the pointer table 58 

25 and the link table 60 of FIGS. 1 4 and 1 5 designate the entry order numbers of the parity groups In the cache 22, in con- 
trast to the pointer and the link tables 52, 54 of FIGS. 5 and 6 which numbers designate the logical address of each data 
stored in the cache 22. The entry order numbers in the pointer and the link tables 14, 15 correspond to the entry line 
numbers in the "Entry" column of the entry tatrie 62 shown in FIG. 16. 

[0062] The entry table 62 consists of the "Entry" column, the "Target Block" column, four "Block" columns that store 
30 the cache storage addresses and the "Parity" column. The parity group of the target data are registered in the cache 22 

and added to the entry table 62 In the order of registration. For example, the entry numbers are allocated to the parity 

group in ascending order. The entry table 62 of FIG. 16 shows that the parity groups were registered in the order of LB- 

0 : LB-3. LB-1 6 - LB-1 9, LB-4 - LB-7, and LB-1 2 - LB-1 5 and have the entry numbers 0 to 3, respectively. 

[0063] In operation, the control device 20 functions the same as described atx>ve with reference to the flowchart of 
35 FIG. 12, when the target data requested by the high-order device 30 exists in the cache 22. i.e., when there is a "cache 

hit," (see the fkiwchart of FIG. 12). Accordingly, no additional description is given regarding the operation of the control 

device 20. 

[0064] When the target data is not stored in the cache 22, the operation is still the same as described above with ref- 
erence to the flowchart of FIG. 13, but only up to step (SI 305). The process in which the target data is read into the 
40 cache 22, as in step (S1306) of the flowchart of FIG. 1 3, is different than in the first aspect and is described below with 
reference to FIG. 17. 

[0065] Initially, the cache management section 26 converts the logical address received from the control section into 
a physical address using the formulas given above for calculating values A, BandC(S1701). For example, if the high- 
order device 30 request data from logical address 1 1 . the physical address becomes Disk 4/Block 2, because values 
45 A, B and C would be 3, 2 and 2, respectively. The cache management section 26 then checks the entry table 62 for a 
blank entry (SI 702). 

[0066] If a blank entry line Is available, the cache management section 26 reserves five blocks of area in the cache 

22, one block for the target data and the remaining four for the the parity data and the non parity data which are in the 
same parity group as the target data (SI 703). Even when the high-order device 30 requires only one block of data, for 

50 example, the target data at logical address LB-11, four more blocks are reserved of the remaining data at logical 
addresses LB-8, LB-9. LB-10 and parity data P2, which belong to the same parity group as the target data. Aftenwards, 
the cache management section 26 allocates the five reserved blocks of area in the cache 22 to the logical blocks in the 
parity group, i.e., to the four "Block" columns and the "Parity" column in the entry table 62 (SI 704). Then the cache man- 
agement section 26 records the information regarding the parity group in the entry table 62 to complete the registration 

55 of the new entry into the entry table, such as entering logical addresses of the data in the "Target Block" column and 
entering the storage address of the parity data and the nonparity data including the target data in the "Block" columns 
(SI 705). 

[0067] To update the link table 60. an invalid value (XX) is inserted in the "Fbnward" column of an open entry line as 
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the forward pointer value (FPV), and the entry number for the cun-ent top entry is inserted in the "Backward" column of 
the open entry line as the backward pointer value (BPV). Then the cache management section 26 changes the Ibnward 
pointer value of the former top entry with the entry number of the new top entry (SI 706). For example, if the newly reg- 
istered entry number is 4 as shown in the link table 60 of FIG. 15, the cache management section 26 changes the for- 

5 ward pointer value off entry line 4 to an invalid value (XX) and the backward pointer value to 3, which is the former top 
entry number. The fonward pointer value of entry line 3 is also changed to 4, which is the new top entry. The pointer table 
58 (shown in FIG. 14) is updated by replacing the value in the Top" column with the entry line number (4) of the new 
entry (SI 707). Lastly, the cache management section 26 instructs the disk devices 32 to transfer all the data in the par- 
ity group of the target data to the cache storage addresses indicated in the "Block" columns of the entry table 62 

10 (SI 708). This completes the process for registering a target data into the cache 22 by registering the entire parity group 
in the cache 22 (SI 709). 

[0068] If at step (SI 702), no blank entry line is available, the oklest or bottom entry is expelled from the cache 22 to 
make room for the entry of the target data Into the cache (S1710). ft should be noted that while the link table 20 is 
described above as being updated before the data in the parity group are actually registered or stored into the cache 
15 22, the link table may also be updated after the registration of the parity group is confirmed. 

[00691 While the principles of the invention have been described above in connection with specific apparatus and 
applications, it is to be understood that this description is made only by way of example and not as a limitation on the 
scope of the invention. 

[0070] The present invention provkles a disk control device operationally connected to a high^rder device for oper- 
20 aling a plurality of disk devices, and having a cache for storing a plurality of data blocks Including a plurality of parity 
data from a corresponding plurality of parity groups in the plurality of disk devk;es, said cfisk control device comprising: 

a cache management table for indicating positions of the data blocks stored in the cache; 
cache management means for transferring a selected data btock requested by the high-order device from the plu- 
25 rality of disk devfoes to the cache when said cache management table indicates that said selected data block is not 
stored in the cache; and 

control means for transfemng said selected data block from said cache to said high-onjer device when said cache 
management means informs said control means that said selected data block is stored In the cache; 
wherein said cache management means holds a selected parity data in the cache when at least one data block 
30 betanging to the same one of the plurality of parity groups as said selected parity data is stored in the cache, and 
expels said selected parity data from the cache when said at least one data block belonging to the same one of the 
plurality of parity groups as said selected parity data is not stored in the cache. 

Claims 

35 

1. A disk control device (20) operationally connected to a high-order device (30) for operating a plurality of disk 
devices (32), and having a cache (22) for storing a plurality of data blocks (34; LB-0 - LB-19) and associated parity 
data (PC to P4) from a plurality of parity groups (LB-0 to LB-3. PO; LB-4 to LB-7. PI ; LB-8 to LB-1 1 , P2; LB-12 to 
LB-1 5. P3; LB-1 6 to LB-1 9, P4) in the plurality of disk devices (32). said disk control device comprising: 

40 

a cache management table (24. 52, 54. 56; 24. 58, 60. 62) for indicating posHions of the data blocks stored In 
the cache (22); cache management means (26) for transferring a 

selected data block requested by the high-order device (30) from the plurality of disk devices (32) to the cache 
(22) when said cache management table (24. 52, 54, 56; 24, 58, 60, 62) indicates that said selected data block 
45 is not stored in the cache (22) ; and 

control means (28) for transferring said selected data block from saki cache (22) to saki high-order device (30) 
when said cache management means (26) Informs ssad control means (28) that said selected data block is 
stored in the cache (22); 

wherein said cache management means (26) holds selected parity data (PO; PI; P2; P3; P4) in the cache (22) 
50 when at least one data block (LB-0 to LB-3; LB-4 to LB-7; LB-8 to LB-11; LB-12 to LB-15; LB-16 to LB-19) 

belonging to the same one of the plurality of parity groups (LB-0 to LB-3, PO; LB-4 to LB-7, PI ; LB-8 to LB-1 1 , 
P2; LB-12 to LB-15. P3; LB.16 to LB-19. P4) as said selected parity data is stored in the cache (22), and expels 
said selected parity data from the cache when said at least one data block belonging to the same one of the 
plurality of parity groups as saki selected parity data is not stored in the cache (22). 

55 

2. A disk control device as claimed in claim 1 , wherein said cache management means (26) includes group registra- 
tion judgment means (42) for accessing the cache management table (24) to determine whether said at least one 
date btock (LB-0 to LB-3; LB-4 to LB-7; LB-8 to LB-1 1 ; LB-12 to LB-15; LB-16 to LB-19) belonging to the same one 
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of the plurality of parity groups (LB-0 to LB-3, PO; LB-4 to LB-7, PI ; LB-8 to LB-1 1 , P2; LB-12 to LB-15. P3; LB-16 
to LB-19. P4) as said selected parity data (PO; PI : P2; P3: P4) exists in the cache (22). 

The control device as claimed in claim 1 or 2, wherein said cache management means (26) includes means (40) 
for determining whether said selected parity data (PO; P1 ; P2; P3; P4) exists in the cache (22). 

A disl^ control device as claimed in claim 1 . 2 or 3. wherein said cache management section (26) includes LRU con- 
trol means (48) for determining the priority of the data blocks stored in the cache (22) based on a relative order in 
which the data blocKs are accessed by said cache management means (26). 

A disk control device as claimed in claim 1 , 2, 3 or 4, wherein said cache management section (26) includes data 
expulsion means (44) for holding said selected parity data in the cache (22) when at least one data block belonging 
to the same one of the plurality of parity groups as said selected parity data is stored in the cache (22), and expel- 
ling said selected parity data from the cache (22) when said at least one data block belonging to the same one of 
the plurality of parity groups as said selected parity data is not stored in the cache (22). 

A disk control device as claimed in claim 1 , 2, 3, 4 or 5, wherein said cache management section (26) includes data 
registration means (46) for transfemng said selected parity data into the cache (22) a first time one of a plurality of 
data blocks in the same one of the plurality of parity groups as said selected parity data is transferred into the cache 
(22). 

A disk control device as claimed in claim 1 , 2. 3, 4. 5 or 6, wherein said cache management section (26) includes 
data registration means (46) for transferring, as a unit, the parity group of said selected data block including corre- 
sponding parity data of said parity group, when said selected data block is requested by said high-order device. 
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